arXiv:astro-ph/9706258vl 25 Jun 1997 


Bayesian Analysis of the Polarization of Distant Radio Sources: 

Limits on Cosmological Birefringence 

Thomas J. Loredo, Eanna E. Flanagan, and Ira M. Wasserman 
Department of Astronomy, Space Sciences Building, Cornell University, Ithaca, New York 14853 

(22 June 1997) 

A recent study of the rotation of the plane of polarization of light from 160 cosmological sources 
claims to find significant evidence for cosmological anisotropy. We point out methodological weak¬ 
nesses of that study, and reanalyze the same data using Bayesian methods that overcome these 
problems. We find that the data always favor isotropic models for the distribution of observed 
polarizations over counterparts that have a cosmological anisotropy of the type advocated in the 
earlier study. Although anisotropic models are not completely ruled out, the data put strong lower 
limits on the length scale A (in units of the Hubble length) associated with the anisotropy; the lower 
limits of 95% credible regions for A lie between 0.43 and 0.62 in all anisotropic models we studied, 
values several times larger than the best-fit value of A « 0.1 found in the earlier study. The length 
scale is not constrained from above. The vast majority of sources in the data are at distances closer 
than 0.4 Hubble lengths (corresponding to a redshift of ssO.8); the results are thus consistent with 
there being no significant anisotropy on the length scale probed by these data. 
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Nodland and Ralston [|| recently analyzed the distribution of the intensity-averaged polarization angles of 160 
galaxies and claim to find significant evidence for cosmological birefringence—a systematic tendency for the plane 
of polarization of light to rotate in a manner dependent on the direction of travel and proportional to the distance 
traveled. The magnitude of the claimed effect is large; the predicted cosmological polarization angle rotation for a 
source at a Hubble distance can be as large as several radians, depending on the direction to the source. Such a 
discovery would be of immense importance if true. However, recent analyses of other data better suited to testing 
the Nodland-Ralston hypothesis find no evidence for cosmological birefringence and convincingly demonstrate that, 
if present, it must have a magnitude at least 30 to 100 times smaller than that claimed by Nodland and Ralston 

These analyses leave open the issue of accounting for the effect Nodland and Ralston claim to detect in the data 
set they analyzed. We argue here that the effect is not present and that methodological weaknesses corrupted their 
study (some similar points have been made in |^,^), and we demonstrate this by reanalyzing the same data in the 
framework of Bayesian inference where the weaknesses we identify are automatically dealt with and overcome. Our 
analysis finds no evidence for cosmological birefringence, and constrains the length scale for any possible effect to be 
at least several times larger than the length scale estimated by Nodland and Ralston. We thus find the original data 
to be completely consistent with the more recently analyzed data |^,^. 

The data in question consist of four quantities for each of 160 radio galaxies and was compiled by Carroll, Field, 
and Jackiw from previously published observations Q. The quantities are: Xii the intensity-averaged polarization 
angle for the radio emission from galaxy z, with the effects of Faraday rotation removed; ipi, the position angle for the 
galaxy describing the orientation of its image on the sky; rii, the direction to the galaxy; and Zi, the redshift of the 
galaxy. Nodland and Ralston posited that the polarization vector of light from a source at redshift z along direction 
n is rotated by an angle (3 given by 


P{z,n) = ^p{z)n-s, (1) 

where p{z) is the luminosity distance to a source at redshift z in a flat universe in units of the Hubble distance, A is a 
length scale (also in units of the Hubble length) determining the magnitude of the effect, and s is a fiducial direction 
associated with the effect. They used a linear correlation statistic to search for correlations between two quantities 
derived from the data for each object: a signed rotation angle, f3^, measuring the rotation of polarization for galaxy 
i 0, and p{zi)ni ■ s, the distance to the galaxy, projected along a candidate fiducial direction s. They determined 
the significances of correlations in 410 candidate s directions using two different procedures involving simulated data 
based on the null hypothesis of no correlation between Xi and any other galaxy parameters. Our criticisms of their 
analysis fall into three categories. 

First, their choice and use of statistics is ad hoc to a troubling degree. To be sure, there are no general criteria in 
frequentist statistics guiding the choice of statistic in realistically complicated problems, and in this sense the results 
of any study will be subjective. But the Nodland and Ralston choices of the linear correlation statistic R using the 
signed angles pf creates apparent correlations when none exist (as they themselves acknowledged). Their calibration 
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using simulated data ideally should account for this, but one might hope better choices are available that do not create 
spurious effects by construction. In particular, since Xi ^nd ipi are axial variables (angles that take on values in [0, tt)), 
the rotation from tpi to Xi is known only modulo tt and is thus ambiguous. Nodland and Ralston choose a priori to 
resolve the ambiguity in the manner that most favors birefringence. Also, their choice of the linear correlation statistic 
seems less than optimal considering that one of quantities correlated is an axial variable, and in some sense should 
be considered to “wrap” with a period of tt. Finally, they use the correlation statistic both for hypothesis testing and 
for parameter estimation. There is an accessible literature on the statistics of directional data that provides more 
appropriate choices |^, particularly for parameter estimation, for which Nodland and Ralston use a specific model 
that is amenable to analysis with standard estimation techniques. 

Second, their choice of null hypothesis is inappropriate. Others have independently made this criticism [Ql^]. Their 
Monte Carlo simulations created data with Xi ^tnd i/'i chosen independently and randomly over [0,7r). However, this 
is not the only possible null hypothesis without birefringence, nor is it a particularly interesting or realistic one. One 
could detect birefringence only if Xi s-nd tpi were correlated in the absence of birefringence; one must presume that 
'ipi determines a preferred direction for Xii with cosmological birefringence causing a rotation away from this. It is 
only because a high degree of correlation is both observed and expected between Xi and tpi that various investigators 
have used this data to search for evidence of birefringence. By their choice of null hypothesis Nodland and Ralston 
essentially set up a “straw man” whose near-certain demise need not imply the existence of birefringence. 

The most appropriate null hypothesis is thus one that correlates Xi and ipi with each other (but not with direction 
or redshift). The existing literature offers a number of possibilities. For example, several studies of the polarization 
of light from radio galaxies explicitly point out that there appear to be two populations of souces, a population 
with the polarization nearly perpendicular to the galaxy orientation, and a population with the polarization not 
very correlated with galaxy orientation, though with a possible weak preference for parallel orientation. These facts, 
suggesting a reasonable and simple non-birefringent null hypothesis, were clearly summarized by Carroll et al. in 
their presentation of the catalog analyzed by Nodland and Ralston. Unfortunately, the manner in which Nodland 
and Ralston resolve the modulo tt ambiguity in the polarization rotation maps the data only to the first and third 
quadrants in the , p{zi)ni ■ s) plane, forcing their linear correlation statistic to consider only lines through the 
origin, i.e., to presume that the polarization vector should be aligned with the galaxy major axis in the absence of 
birefringence {(3^ is presumed to vanish at low redshift, an observation also made in |^). Since the data are known 
to prefer perpendicular alignment, a large correlation must result, due not to a real correlation of rotation angle with 
redshift, but rather to a poor choice of statistic and null hypothesis. Even a real correlation between rotation angle 
and redshift need not imply birefringence. As Carroll et al. noted |^, the perpendicularly aligned population consists 
of the more luminous galaxies, and is thus visible to higher redshifts than the broader, possibly parallel-aligned 
population. Thus the typical alignment angle could vary with redshift simply due to the selection effect that luminous 
galaxies are visible to larger redshifts, so that perpendicularly aligned sources appear preferentially at large redshift. 
Any claim of evidence for birefringence must take such reasonable “null” isotropic models into account. If there is a 
significant correlation, assessment of such a hypothesis could require explicit inclusion of the galaxy luminosities in 
the analysis. 

Finally, our third criticism of the Nodland-Ralston analysis is that in their main Monte Carlo procedure they 
incorrectly assessed the significance with which they rejected the null hypothesis. They find the significances associated 
with each of 410 candidate s directions, and report the smallest value as their main result (a probability for falsely 
rejecting the null—the Type I error probability—of < 10“^). This value fails to account for the fact that 410 directions 
were searched because the value of s is uncertain a priori. Although the results of the analyses for all 410 directions 
are unlikely to be independent, the actual Type I error probability is certainly many times larger than the value they 
report. The second procedure they perform more appropriately accounts for the size of the parameter space they 
searched by performing a similar search for each simulated data set. It is in fact the only reasonable procedure of the 
two they use. However, they apply it only to subsets of the data at low and high redshift, finding a less significant 
departure from the null hypothesis for the high redshift subset and no significant departure from the null for the low 
redshift subset, yet concluding these results corroborate their earlier analysis (presumably because the data sets are 
smaller so only smaller departures are expected). The significance found by application of their second procedure to 
the entire data set would be by far the most interesting quantity they could calculate with their approach, but it 
remains curiously absent from their analysis. 

Many of these problems could be overcome with a careful frequentist analysis of these data; some are addressed 
in the recent analysis of Carroll and Field ||^ disputing the Nodland and Ralston results. Here we instead pursue 
a Bayesian approach to the problem |^. It has the virtue of circumventing many of the problems cited above in a 
largely “automatic” fashion. We thus intend our study not only to address the scientific question of the existence of 
evidence for birefringence, but also to offer a new tool for the study of polarization angle data that we believe to be 
superior to existing tools in many respects. 

Data enter a Bayesian analysis through the likelihood function—the probability for the data given some hypothesis 
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or model, M, which in general may have some unknown parameters V. We presume that, once a model is specified, 
the joint probability for the data for all sources is simply the product of individual probabilities for a single source’s 
data, _p(xi, Ui, Zi \ V, M). We can factor this probability as follows: 


p{Xi,'tpi,ni,Zi \V,M)= p{xi I ipi.rii, Zi,!^ , M)p{il)i,ni, Zi \ V,M). 


( 2 ) 


If we were interested in comparing hypotheses that explicitly modeled the distribution of source positions in space, 
or that sought to detect anisotropy in the distribution of source directions or orientations, the last factor would be 
important. Instead, we here focus only on isotropy evident in polarization angle rotation, as did Nodland and Ralston. 
Accordingly, we can ignore the last factor. 

Our task thus becomes one of specifying models that allow us to calculate the individual source likelihood functions, 
Ci(V) = p{xi I V’o V, M). The full likelihood is just the product of all Ci{V) functions; C{P) = Hi Note 

that we must specify a probability for the actual datum, Xi (given ijji, rii, and Zi), and not for a derived quantity such 
as the signed rotation introduced by Nodland and Ralston in an ad hoc attempt to resolve modulo tt ambiguities. 
As will become apparent, our approach lets the data determine which possible resolution of the ambiguity is most 
probable for each object and each choice of hypothesis. 

We now must specify choices for Ci. One choice is to take p{xi \ piyUii, Zi,V, M) to be a flat distribution, so 
that Ci = I/tt; this is the null hypothesis considered by Nodland and Ralston. We consider this uncorrelated 
hypothesis ourselves, but we must specify more; no Bayesian calculation can assess a “null” hypothesis without specific 
consideration of one or more alternatives. This may appear to be a serious drawback of the Bayesian approach, since 
tests like that used by Nodland and Ralston appear capable of rejecting a null hypothesis without making subjective 
choices of alternatives. But this apparent advantage of the frequentist hypothesis testing approach is largely illusory. 
Subjectivity enters the frequentist test in the choice of a test statistic. Statisticians have long appreciated that the 
choice of a particular test statistic can often be considered to represent an implicit choice of a particular class of 
alternative hypotheses, in the sense that a Bayesian comparison of the null hypothesis to that class of alternatives 
results in use of the same statistic used in the frequentist test (similar conclusions apply in a maximum likelihood 
setting). We thus exchange subjectivity in the choice of statistic for subjectivity in the choice of models. This has 
the virtue of making the assumptions underlying one’s analysis explicit, which in turn can guide the specification of 
further models that usefully generalize the analysis. 

Jeffreys ||l^ showed that Bayesian inference with conditional distributions that are Gaussian and correlated along 
straight lines leads to use of the linear correlation statistic. Since we seek to mimic in a Bayesian setting the analysis 
Nodland and Ralston attempted in a frequentist setting, their use of the correlation statistic might suggest that we 
introduce straight-line models and Gaussian errors. However, we are not free to introduce a priori the signed rotation, 
Pf, they used in their correlation analysis; any model we choose must specify a probability for the actually observed 
“raw” data. Also, since Xi is an axial quantity, we cannot use a Gaussian distribution for it, but must instead use an 
axial distribution that recognizes that Xi is restricted to the interval [0, tt). Since no correct Bayesian analysis of these 
data can lead precisely to their choice of statistics, we instead seek to mimic the spirit of their analysis, introducing 
linear models where appropriate and using distributions that generalize the Gaussian to angular variables. 

Our models presume that Xi = i'Pi+Si) modTr, where 0i is the actual angle of rotation of the polarization vector from 
the object direction ipi, including a possible birefringent term. The probability distribution for 0i is a circular (not 
axial) distribution (i.e., its domain is [0, 27r)) that can depend on and Zi. From this assumption it is straightforward 
to derive the axial probability distribution for the observable Xi from any hypothesized distribution f{0i) for the 
rotation angle 0i: 


CPV)=f{Ap+f{A, + 7r), (3) 

where 

* \ — {ipi — Xi)t otherwise. ^ ' 

The quantity is similar to pf^ of Nodland and Ralston. But it was derived from an explicit model for the polarization 
rotation rather than specified ad hoc, and the likelihood factor for each galaxy explicitly accounts for the two possible 
resolutions of the modulo tt ambiguity associated with Xi ■ 

To complete specification of a model, we must specify a distribution for the rotation angle 0i. To follow the spirit 
of the Nodland and Ralston analysis, we construct candidate distributions using a well known circular generalization 
of the Gaussian distribution, the Von Mises distribution |^. The Von Mises distribution for an angle 0 concentrated 
at angle Oq with concentration parameter k is 
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( 5 ) 


2ttIq{k) 

where the normalization constant requires evaluation of a modified Bessel function, Io{k). When k 3> 1 this distribu¬ 
tion becomes approximately Gaussian, with a mean of Oq and a variance of 1/k. 

For our simplest nontrivial model, expressing a possible relationship between Xi and ipi in the absence of birefrin¬ 
gence, we take f{9i) to be a Von Mises distribution with unknown mean 9m and concentration k; both parameters will 
be estimated using the data. We denote this model Mi. Using equation (^), this corresponds to an object likelihood 
function Ci{9m, k) = g{Ai; 9m, k), where 


g{9-,9o,K) 


cosh [k cos{(j) — (/>o)] 
27r/o(K) 


( 6 ) 


Contours of the resulting joint posterior distribution for 9m and k appear in Fig. 1; Table 1 lists the most probable 
parameter values. Here and throughout this paper, posteriors are found by multiplying the likelihood by prior 
distributions that are fiat with respect to the plotted axes, and normalized over the range plotted. In this case, the 
prior is flat over 9m and flat over the logarithm of k (the standard scale-invariant prior for an a priori unknown nonzero 
scale parameter). 

As is clear from the figure, the data strongly favor polarization angles that are oriented perpendicular to the 
galaxy’s major axis, as one might have anticipated from visual inspection of the data [^. The distribution of angles 
is moderately concentrated, with the mode at k = 2.0 (corresponding to an angular half-width « 40°). Note that 
distributions that preferentially align Xi with ^/jj, as was implicitly presumed in the Nodland and Ralston analysis, 
are strongly ruled out. It is also evident from this figure that the null hypothesis of no relationship between Xi 
'i/'i (corresponding to k = 0) is strongly ruled out. In the Bayesian framework we can quantify this by calculating 
the posterior odds in favor of the Von Mises model Mi over the null hypothesis (i.e., the ratio of their posterior 
probabilities). The posterior odds is the product of a subjective prior odds factor, and a data-dependent term called 
the Bayes factor. The Bayes factor is the ratio of the global likelihoods for the competing models, where the global 
likelihood for a model with free parameters is found by averaging the likelihood for its parameters (weighted by the 
prior distributions for the parameters). This is in contrast to frequentist hypothesis testing methods which instead 
extremize over parameter space (e.g., maximum likelihood or minimum y^). The averaging required by the Bayesian 
approach implements an automatic “Ockham’s razor” that penalizes models for the size of their parameter space. As 
a result, even though a complicated model may fit the data better than a simpler model in the sense of having a 
larger maximum likelihood, it can be less probable than the simpler model if the likelihood is not increased enough 
to account for the larger size of the parameter space of the more complicated model. It is thus customary in Bayesian 
literature to report Bayes factors for model comparison, essentially presuming the prior odds to be unity (since model 
complexity is already incorporated into the Bayes factor). If one judges certain models to be significantly less probable 
than others a priori (on grounds other than their complexity), one can multiply the reported Bayes factor to account 
for this. 

For the null hypothesis, Ci = I/tt, and there are no free parameters; we denote this model as Mq. The likelihood 
for this model is thus simply 1 / tt ^ , where for the full dataset N = 160. For the Von Mises model, Mi, one must 
average the numerically calculated likelihood over 9m and k. The resulting Bayes factor is 1.9 x 10'^ in favor of Mi 
over Mq, conclusively favoring a correlation between Xi and ipi over the null hypothesis of no correlation. 

We now introduce cosmological birefringence into the Von Mises model by setting Ci{9m,n) = g{Ai\9m + Pi, n), 
where the expected angle of rotation for object i is shifted by Pi = P{zi, n^) (c.f., eqn. (|^)), as posited by Nodland and 
Ralston. This adds three parameters to the model, the dimensionless length scale A, and the unknown fiducial direction 
s (specified by right ascension a and declination 6). We denote this model as M{. We use fiat priors for a and sin((5) 
and a fiat prior for log(A) with O.OI, < A < 10 (corresponding to distance scales appropriate for detecting a cosmological 
effect; i.e., at least a reasonable fraction of a Hubble length, up to the length scale of the observable universe). The 
solid curve in Fig. 2 shows the resulting marginal posterior distribution for A. This marginal distribution p{X \ D, M[) 
is found by numerically integrating the full joint posterior distribution p{X, a, S,9m, k \ D,M[) over the remaining 
four parameters a, 6, 9m, k, and thus accounts for parameter uncertainties and correlations between all parameters 
|p4| . For A > I, the marginal posterior is very nearly flat, i.e., it resembles the prior. In other words, in this region 
the data have taught us nothing—they are uninformative about birefringence on these length scales, as one might 
have guessed a priori since there are few galaxies in the data set with z > 1. But in the A < I region, the data are 
very informative. The 95.4% (“2 (t”) highest posterior density credible region (CR) begins at A = 0.43 and extends 
to the upper limit of the prior range. Birefringent models with A ^ 0.4 are thus conclusively ruled out. The best-fit 
model of Nodland and Ralston had A ~ 0.1. 

The Bayes factor comparing this model to its simpler isotropic counterpart is 0.46. Though slightly favoring 
the isotropic model, this Bayes factor indicates that the data do not conclusively discriminate between birefringent 
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and isotropic Von Mises models. However, the allowable birefringent models are those about which the data are 
uninformative (those with A ^ 1). The models with small A favored by Nodland and Ralston are conclusively 
rejected. 

Finally, we note that the best-fit values of the parameters for the underlying polarization angle distribution are 
little changed from their best-fit values in the isotropic model (see Table 1). Thus the preference for perpendicular 
alignment is not a consequence of ignoring possible birefringence. The best-fit fiducial direction is at a = 149° and 
S = 11°, very different from the Nodland and Ralston best-fit direction of a = 315° and 6 = —10°. However, the 
posterior varies only weakly with s, so the fiducial direction is highly uncertain and can lie almost anywhere on the 
sky. This is consistent with there being negligible evidence for birefringence in these data. 

We also considered models that are more complicated than those implicit in the Nodland and Ralston analysis. 
These models were motivated by the possibility that there are two populations of sources, one with polarization 
preferentially perpendicular to the source orientation (as was infered in the model analyzed above), and another with 
polarization only weakly aligned, with a possible preference for parallel alignment. Accordingly, we considered models 
that superposed parallel and perpendicular populations by taking 

Cii9m, k) = fgiX; 0, K||) + (1 - /) 5 (A,; 7r/2, k±), (7) 

where / is the fraction of sources in the parallel population, and and are the widths of the parallel and 
perpendicular populations. We studied this model and two simpler cases of it: a model combining the parallel 
population with a population with a flat distribution (i.e., fixing k± =0), and a model combining the perpendicular 
population with a flat A^ distribution (i.e., fixing K|| =0). Of these three classes of models, only the latter was favored 
by the data over the single population model discussed above (regardless of whether birefringence was added or not). 
We here discuss results only for this model, which we denote M 2 . 

Model M 2 has two parameters, / and k±. Fig. 3 shows contours of the joint posterior for these parameters. Table 1 
lists the most probable values. There is a clear mode corresponding to roughly half of the sources coming from a flat 
population and the remainder coming from a perpendicular population that is moderately concentrated with k_l « 4 
(corresponding to a half-width of w 30°). But the posterior has a tail extending to low / with / = 0 lying within the 
95.4% CR, suggesting that the data do not decisively prefer this model to a single population model. The Bayes factor 
comparing this model to the single population model discussed above is 54, indicating significant but not conclusive 
evidence for the addition of a flat population. 

Next we considered a birefringent version of this model, M^, by taking 

A(6'm, k) = - + {I - f)g{Ai + [3i; ^12, ki_). (8) 

TT 

This model has five parameters: /, k±, A, and s = {a, <5}. The Bayes factor comparing this model to its counterpart 
without birefringence is ~ 0.44, again indicating that though the data prefer models without birefringence, they do 
not conclusively rule it out. However, as with single component models, small values of A are ruled out, as is apparent 
from the marginal distribution shown as the dashed curve in Fig. 2 (the lower limit of the 95.4% CR is at A = 0.52). 
Again, values of the order of that found by Nodland and Ralston are excluded with high probability. 

Following Nodland and Ralston, we repeated the analysis using only the subset of 71 sources with z > 0.3, the 
redshift where they claim the birefringence effect “turns on.” Table 2 lists the resulting parameter estimates and 
Bayes factors. None of the conclusions found in our analysis of all 160 sources was changed. Bayes factors again 
indicate that isotropic models are prefered to their birefringent counterparts, though not decisively so. The credible 
regions for A again decisively rule out the Nodland and Ralston value; the lower limit of the 95.4% CR is at A « 0.47 
for the single component model M(, and at A « 0.62 for model combining a perpendicular population with a flat 
population. Note that the estimated fraction in the flat polulation is significantly smaller than was the case in the 
analysis of all 160 galaxies. This may support the observation of Carroll, Field, and Jackiw ||] that the sources with 
polarization perpendicular to the galaxy orientation tend to have higher luminosities (and thus dominate the sample 
at high redshifts). 

We emphasize that these results have been obtained using the same data analyzed by Nodland and Ralston, with the 
same explicit model for birefringence, and with models for the underlying relationship between Xi tpi as close as 
possible to what was implicitly presumed in their analysis. Our findings reveal the data in question to be completely 
consistent with more recently analyzed data that constrain any cosmological birefringence to be much smaller in 
magnitude than that claimed by Nodland and Ralston ||,^. The primary difference between the Nodland-Ralston 
study and ours is methodological. Our methodology improves on the frequentist one adopted by Nodland and Ralston 
in all three categories mentioned above. Our choice of statistics is dictated by our choice of models via the rules of 
probability theory and rigorously accounts for the axial nature of the observed quantities, and in particular for modulo 
TT ambiguities. We considered a variety of hypotheses with and without birefringence, and thus have not fallen prey 
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to claiming detection of birefringence by rejecting an unnecessarily simple “straw man” null hypothesis. And finally, 
the Bayesian methodology fully and automatically accounts for the sizes of the parameter spaces of our models and 
for all correlations between inferred parameters, both in the averaging process underlying calculation of Bayes factors 
for comparing models, and in the integrations required in calculating marginal distributions for subsets of parameters 
of particular interest (e.g., A). Quite apart from the issue of the existence of birefringence, we believe these qualities 
strongly recommend Bayesian methods for the analysis of polarization angle data. They can be straightforwardly 
generalized to address questions beyond those dealt with in this brief paper. For example, uncertainties in the measured 
quantities can be straightforwardly accounted for by marginalizing. Also, more sophisticated models can be studied 
by including data on the polarization strengths and luminosities of sources in order to determine whether alignment 
is correlated with these quantities, and whether population parameters change with redshift in a manner that can be 
attributed to source evolution, or in a manner that instead demands the existence of cosmological anisotropy. 

We thank Sean Carroll for providing a machine-readable catalog of the polarization data published in [|j , and Borge 
Nodland for information about typographical errors in the catalog. This work was supported in part by NASA grants 
NAG 5-3427 and NAG 5-3097 and by NSF grants AST 91-19475, AST 93-15375, and PHY-9408378. 
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FIG. 1. Joint posterior distribution for 9m and logj^Q(K) in the single component model Mi, with no birefringence. Contours 
indicate the boundaries of 68.3% (dotted), 95.4% (dashed), and 99.73% (solid) credible regions. Cross indicates the mode. 


FIG. 2. Logarithm of marginal posterior distributions for the birefringence lengthscale A (in units of the Hubble distance), 
with arbitrary normalization. Dots mark the lower limits of 95.4% credible regions. The solid curve is for the single component 
birefringent model M[\ the dashed curve is for the two component birefringent model M 2 . 


FIG. 3. Joint posterior distribution for / and logig(«;x) in the two component model M 2 , with no birefringence. Contours 
indicate the boundaries of 68.3% (dotted), 95.4% (dashed), and 99.73% (solid) credible regions. Cross indicates the mode. 
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TABLE I. Most probable parameter values and Bayes fac- 


tors; all data (160 galaxies) 

Quantity 

Isotropic 

Birefringent 


Ml, M[: Single Von Mises 

(°) 

94.6 

91.3 

logio K 

0.29 

0.31 


1.9 X 10'* 

8.5 X 10® 

logio 

- 

-0.15 


- 

149 

sin S 

- 

0.19 

M2,M^: 

Perpendicular Von Mises + Constant 

/ 

0.56 

0.55 

logio «-L 

0.81 

0.84 


1.0 X 10® 

4.5 X 10® 

logio ^ 

- 

-0.13 


- 

146 

sin S 

- 

0.23 

“^Bayes factors compare the models to model Mo positing a 
flat polarization angle distribution, uncorrelated with position 
angle. 


TABLE 11. Most probable parameter values and Bayes fac¬ 
tors; data selected with 2 > 0.3 (71 galaxies) 

Quantity 

Isotropic 

Birefringent 



Single Von Mises 

(°) 

93.7 

85.8 

logio « 

0.56 

0601 


2.4 X 10'^ 

1.3 X 10^ 

logio -I' 

- 

-0.25 

a(°) 

- 

155 

sin S 

- 

0.64 

M2,M) 

): Perpendicular Von Mises -I- Constant 

/ 

0.17 

0.051 

logio «-L 

0.71 

0.63 

B^ 

2.2 X 10® 

9.6 X 10'^ 

logio ^ 

- 

-0.12 


- 

155 

sin 5 

- 

0.38 


“^Bayes factors compare the models to model Mo positing a 
flat polarization angle distribution, uncorrelated with position 
angle. 










